Page 1 of 7 




OIPE 



T 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/074,596 



DATE: 03/01/2002 
TIME: 10:54:34 



3 
4 
6 
7 
9 

C--> 11 

12 
14 
15 
17 
19 
21 
22 
23 
24 
26 
27 
28 
30 
31 
33 
34 
36 
37 
39 
40 
42 
43 
45 
46 
48 
49 
51 
52 
54 
55 
57 
58 
60 
61 
63 
64 
66 



input Set : A:\Clfr007.app 
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<110> APPLICANT: ROSENBLUM, MICHAEL G. 

<120> t?t"e G 0F L ™?0N: MODIFIED PROTEINS, DESIGNER TOXINS , AND METHODS OF 

MAKING THEEOF 
<130> FILE REFERENCE: CLFR:007US 

<140> CURRENT APPLICATION NUMBER: US/10/074,596 

<141> CURRENT FILING DATE: 2002-02-12 
<150> PRIOR APPLICATION NUMBER: 60/268,402 
<151> PRIOR FILING DATE: 2001-02-12 
<160> NUMBER OF SEQ ID NOS : 11 
<170> SOFTWARE: Patentln Ver. 2.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 316 
<212> TYPE: PRT 

<213> ORGANISM: Gelonium multiflorum 

„:t 0> L ys E cS N As; Met Lys Val Tyr Tr P lie Lys lie ,1a Val Ala Thr 

1 5 10 15 

Trp Phe Cys Cys Thr Thr He Val Leu Gly Ser Thr Ala Arg lie Phe 

20 25 30 

Ser Leu Pro Thr Asn Asp Glu Glu Glu Thr Ser Lys Thr Leu Gly Leu 

35 40 45 

Asp Thr Val Ser Phe Ser Thr Lys Gly Ala Thr Tyr He Thr Tyr Val 

50 55 60 

Asn Phe Leu Asn Glu Leu Arg Val Lys Leu Lys Pro Glu Gly Asn Ser 
65 70 75 80 

His Gly lie Pro Leu Leu Arg Lys Lys Cys Asp Asp Pro Gly Lys Cys 

85 90 95 

Phe Val Leu Val Ala Leu Ser Asn Asp Asn Gly Gin Leu Ala Glu He 

100 105 HO 

Ala lie Asp Val Thr Ser Val Tyr Val Val Gly Tyr Gin Val Arg Asn 

115 120 125 

Arg ser Tyr Phe Phe Lys Asp Ala Pro Asp Ala Ala Tyr Glu Gly Leu 

130 I 35 140 

Phe Lys Asn Thr He Lys Thr Arg Leu His Phe Gly Gly Ser Tyr Pro 
145 150 155 1^0 

Ser Leu Glu Gly Glu Lys Ala Tyr Arg Glu Thr Thr Asp Leu Gly He 

165 170 175 

Glu Pro Leu Arg lie Gly He Lys Lys Leu Asp Glu Asn Ala He Asp 

180 185 I 90 

Asn Tyr Lys Pro Thr Glu He Ala Ser Ser Leu Leu Val Val He Gin 

195 200 205 

Met Val Ser Glu Ala Ala Arg Phe Thr Phe He Glu Asn Gin He Arg 
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fi7 210 215 220 

69 Asn Asn Phe Gin Gin Arg He Arg Pro Ala Asn Asn Thr He Ser Leu 

70 225 230 235 ^ 

72 Glu Asn Lys Trp Gly Lys Leu Ser Phe Gin lie Arg Thr Ser Gly Ala 

73 245 250 ^ 

75 Asn Gly Met Phe Ser Glu Ala Val Glu Leu Glu Arg Ala Asn Gly Lys 

76 260 265 270 

78 Lys Tyr Tyr Val Thr Ala Val Asp Gin Val Lys Pro Lys lie Ala Leu 

79 275 280 285 

81 Leu Lys Phe Val Asp Lys Asp Pro Lys Thr Ser Leu Ala Ala Glu Leu 

82 290 295 300 

84 He He Gin Asn Tyr Glu Ser Leu Val Gly Phe Asp 

85 305 310 315 

88 <210> SEQ ID NO: 2 

89 <211> LENGTH: 1176 

90 <212> TYPE: DNA 

91 <213> ORGANISM: Gelonium multiflorum 

S clgcttctcTcttgtttggg ataa tg aaa g gg aacat g aa ggtgtactgg attaagattg 60 

95 ct g tggc g ac atggttttgc tgcactacta ttgtaottgg atcaacggcg aggattttct 120 

96 ctct?cccac aaatgatgaa gaagaaacca gtaagacgct tggcctggac accgtgagct 180 

97 ttagcactaa aggtgccact tatattacct acgtgaattt cttgaatgag ctacgagtta 240 
aat'tgaaacc cgaaggtaac a g ccat gg aa tcccatt g ct gcgcaaaaaa tgtgatgatc 

99 ctggaaagtg tttcgttttg gtagcgcttt caaatgacaa tggacagttg 9=99«atag 360 

100 ctatagatgt tacaagtgtt tatgtggtgg gctatcaagt aagaaacaga tcttacttct 420 

101 ttaaagatgc tccagatgct gcttacgaag gcctcttcaa aaacacaatt aaaacaagac 480 

102 ttcattttgg cggcagctat ccctcgctgg aaggtgagaa ggcatataga gagacaacag 540 

103 acttaggca? tgaaccatta aggattggca tcaagaaact tgatgaaaat gcgatagaca 600 
III atta?aaacc aLggagata gcLgttctc tattggttgt tattoaaatg 9tgtctgaag 660 
105 cagctcgatt cacctttatt gagaaccaaa ttagaaataa ctttcaacag agaattcgcc 720 
10 cggcgaataa tacaatcagc cttgagaata aatggggtaa actctcgttc oagatccgga 780 

107 caLaggtgc aaatggaatg ttttcggagg cagttgaatt 99aacgtgca «tggeaaa» 40 

108 aatactatgt caccgcagtt gatcaagtaa aacccaaaat agcactcttg aagttc g tcg 9UU 

109 ataaagatcc taaaacgagc cttgctgctg aattgataat ccagaactat ^gtcattag 960 

110 tgggc?ttga ttagtacaac ttattgtgct ttttatatat tatagatatg "gccgggcc 1020 

111 atgtattggc cttcgtagct taaataaagg catcgaatat tagcctcggt ggt g tatcta 1080 

112 ?catgctg?g ttgt.aa.ct gccaatgttt atgttatcaa acagaaattg gcatgaagtt 1140 

113 tctgtacaag tgttcaataa actgggctat acatgc 

116 <210> SEQ ID NO: 3 

117 <211> LENGTH: 33 

118 <212> TYPE: DNA 

119 <213> ORGANISM: Homo sapiens 

121 <400> SEQUENCE: 3 33 

122 gctgcccaac cagccatggc ggacattgtg atg 

125 <210> SEQ ID NO: 4 

126 <211> LENGTH: 50 

127 <212> TYPE: DNA 

128 <213> ORGANISM: Homo sapiens 
130 <400> SEQUENCE: 4 
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131 gccggagcct ggcttgcacg ctgccgctgg tggagccttt gatcatccag 50 

134 <210> SEQ ID NO: 5 

135 <211> LENGTH: 45 

136 <212> TYPE: DNA 

137 <213> ORGANISM: Homo sapiens 

139 <400> SEQUENCE: 5 45 

140 aagccaggct ccggcgaagg cagcaccaaa ggcgaagtga aggtt 

143 <210> SEQ ID NO: 6 

144 <211> LENGTH: 30 

145 <212> TYPE: DNA 

14 6 <213> ORGANISM: Homo sapiens 

148 <400> SEQUENCE: 6 30 

149 gccaccgcca ccactagttg aggagactgt 

152 <210> SEQ ID NO: 7 

153 <211> LENGTH: 51 

154 <212> TYPE: DNA 

155 <213> ORGANISM: Artificial Sequence 

\H <223> OTHeHnFORMATION: Description of Artificial Sequence: Synthetic 
159 Primer 

161 <400> SEQUENCE: 7 _ 51 

162 ggcggtggct ccgtcatgac ggacattgtg atgacccagt ctcaaaaatt c 

165 <210> SEQ ID NO: 8 

166 <211> LENGTH: 33 

167 <212> TYPE: DNA 

168 <213> ORGANISM: Artificial Sequence 

l£ <IIT> OTHER^NFORMATION: Description, of Artificial Sequence: Synthetic 
172 Primer 

174 <400> SEQUENCE: 8 33 

175 ggtggcggtg gctccggtct agacaccgtg acg 

178 <210> SEQ ID NO: 9 

179 <211> LENGTH: 45 

180 <212> TYPE: DNA 

181 <213> ORGANISM: Artificial Sequence 

III <IIT> OTHER^INFORMATION : Description of Artificial Sequence: Synthetic 
185 Primer 

187 <400> SEQUENCE: 9 + „ 45 

188 aaggctcgtg tcgacctcga gtcattaagc tttaggatct ttatc 

191 <210> SEQ ID NO: 10 

192 <211> LENGTH: 1527 

193 <212> TYPE: DNA 

194 <213> ORGANISM: Artificial Sequence 

IV, tilt OTHER^INFORMATION : Description of Artificial Sequence: Synthetic 

199 <220> FEATURE: 

200 <221> NAME/KEY: CDS 

201 <222> LOCATION: (1)..(1521) 
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203 <400> SEQUENCE : 10 tcc aca tca 48 



203 <400> SEQUENCE: ±u tca 

204 atg acg gac att gtg atg acc cag tct caa aaa ttc. atg ^ 

205 Met Thr Asp He Val Met Thr Gin Ser Gin Lys Phe m ^ 

206 1 ^ flnr atc acc tgc aag gec agt cag aat gtg gat 96 

208 gta gga gac agg gtc age gtc acc tgc g g g ^ ^ ^ ^ 

209 val Gly Asp Arg Val Ser Val Tftr cys uyb ^ 

B S 2 S S 3 S S = 5: S S = K S = S l " 
1 2 2 £ 2 E S 5 S SS £ S S S K S S " 

SS .« „. «, « S « ttc £ =« .<* .tc .,. .« W «• 

221 Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Tnr ^ 

222 65 - J* aaa tat ttc tgt cag caa tat aac age tat 288 

224 cag tct gaa gac ttg gca gag tat ttc tgt g ^ ^ 

225 Gin Ser Glu Asp Leu Ala Glu Tyr Phe L.ys u ^ 

S IS 2 S £ S S ffi S S 2 E S 2 S 2 S 

is 2 2?; s i x 2 s 2 i s s 2 s 2 2 s s " 
1 a s s s = 2 s £ s 2 a = s E 5 2 m 

238 130 <- tat att atc tct gga ttc act ttc ggt aat tac tgg 480 

l\l S £ Leu Ser S - fax Ser Sy Phe Thr P h e Giy Tyr «p 

' £ atg aac tgg gtc cgc cag tct cca gag aag ggg ctt gag tgg att gca 528 
245 Met Asn Trp Val Arg Gin Ser Pro Glu Lys Gly Leu Glu Trp 

Hi C £ 5 2 2 2 2 2 = = S 5 2 S S 2 

1 a 2 s a s s 2 ~ = s s s | s s - ~ 

» S 2 S 2 2 2 £ S S S S £ E S 2 2 m 

258 210 ... oat aac tac gtt ggg cac tat ttt gac cac tgg ggc 720 

260 tgt acc agt tat ggt aac tac gtt ggg 

261 Cys Thr Ser Tyr Gly Asn Tyr Val Gly His Tyr pne y ^ 

262 225 , „tr acc atc tcc tea get age ggt ggc ggt ggc tcc 768 

264 caa ggc acc act etc acc gtc tec tea y y * ger 

265 Gin Gly Thr Thr Leu Thr Val Ser Ser Ala Ser Gly Gly <, y y 

266 245 250 
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268 ggt eta gac acc gtg 

269 Gly Leu Asp Thr Val 

270 260 

272 tac gtg aat ttc ttg 

273 Tyr Val Asn Phe Leu 

274 ' 275 

276 aac age cat gga ate 

277 Asn Ser His Gly He 

278 290 

280 aag tgt ttc gtt ttg 

281 Lys Cys Phe Val Leu 

282 305 

284 gaa ata get ata gat 

285 Glu He Ala He Asp 

286 325 

288 aga aac aga tct tac 

289 Arg Asn Arg Ser Tyr 

290 340 

292 ggc etc ttc aaa aac 

293 Gly Leu Phe Lys Asn 

294 355 

296 tat ccc teg ctg gaa 

297 Tyr Pro Ser Leu Glu 

298 370 

300 ggc att gaa cca tta 

301 Gly He Glu Pro Leu 

302 385 

304 ata gac aat tat aaa 

305 He Asp Asn Tyr Lys 

306 405 

308 att caa atg gtg tct 

309 He Gin Met Val Ser 

310 420 

312 att aga aat aac ttt 

313 He Arg Asn Asn Phe 

314 435 

316 age ctt gag aat aaa 

317 Ser Leu Glu Asn Lys 

318 450 

320 ggt gca aat gga atg 

321 Gly Ala Asn Gly Met 

322 465 

324 ggc aaa aaa tac tat 

325 Gly Lys Lys Tyr Tyr 

326 485 

328 gca etc ttg aag ttc 

329 Ala Leu Leu Lys Phe 

330 500 

333 <210> SEQ ID NO: 11 



age ttt age 
Ser Phe Ser 



aat gag eta 
Asn Glu Leu 
280 

cca ttg ctg 
Pro Leu Leu 
295 

gta gcg ctt 
Val Ala Leu 
310 

gtt aca agt 
Val Thr Ser 

ttc ttt aaa 
Phe Phe Lys 



act aaa 
Thr Lys 
265 

cga gtt 
Arg Val 

cgc aaa 
Arg Lys 

tea aat 
Ser Asn 

gtt tat 
Val Tyr 
330 
gat get 
Asp Ala 
345 



ggt gec act 
Gly Ala Thr 

aaa ttg aaa 
Lys Leu Lys 
285 

aaa tgt gat 
Lys Cys Asp 

300 
gac aat gga 
Asp Asn Gly 
315 

gtg gtg ggc 
val Val Gly 

cca gat get 
Pro Asp Ala 



aca att aaa 
Thr He Lys 
360 

ggt gag aag 
Gly Glu Lys 
375 

agg att ggc 
Arg He Gly 
390 

cca acg gag 
Pro Thr Glu 

gaa gca get 
Glu Ala Ala 



caa cag aga 
Gin Gin Arg 
440 

tgg ggt aaa 
Trp Gly. Lys 

455 
ttt teg gag 
Phe Ser Glu 
470 

gtc acc gca 
Val Thr Ala 

gtc gat aaa 
Val Asp Lys 



aca aga 
Thr Arg 

gca tat 
Ala Tyr 

ate aag 
He Lys 

ata get 
He Ala 
410 
cga ttc 
Arg Phe 
425 

att cgc 
He Arg 



ctt cat ttt 
Leu His Phe 
365 

aga gag aca 
Arg Glu Thr 
380 

aaa ctt gat 
Lys Leu Asp 
395 

agt tct eta 
Ser Ser Leu 

acc ttt att 
Thr Phe He 



tat att acc 
Tyr He Thr 
270 

ccc gaa ggt 
Pro Glu Gly 

gat cct gga 
Asp Pro Gly 

cag ttg gcg 
Gin Leu Ala 
320 

tat caa gta 
Tyr Gin Val 

335 
get tac gaa 
Ala Tyr Glu 
350 

ggc ggc age 
Gly Gly Ser 



etc teg 
Leu Ser 

gca gtt 
Ala Val 

gtt gat 
Val Asp 
490 
gat cct 
Asp Pro 
505 



ccg gcg aat 
Pro Ala Asn 
445 

ttc cag ate 
Phe Gin He 

460 
gaa ttg gaa 
Glu Leu Glu 
475 

caa gta aaa 
Gin Val Lys 

aaa taatga 
Lys 



aca gac ttg 
Thr Asp Leu 

gaa aat gcg 
Glu Asn Ala 
400 

ttg gtt gtt 
Leu Val Val 

415 
gag aac caa 
Glu Asn Gin 
430 

aat aca ate 
Asn Thr He 



egg aca tea 
Arg Thr Ser 

cgt gca aat 
Arg Ala Asn 
480 

ccc aaa ata 
Pro Lys He 
495 



816 
864 
912 
960 
1008 
1056 
1104 
1152 
1200 
1248 
1296 
1344 
1392 
1440 
1488 
1527 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/074,596 



DATE: 03/01/2002 
TIME: 10:54:35 



Input Set : A:\Clfr007.app 

Output Set: N:\CRF3\03012002\J074596.raw 

L:11 M:270 C: Current Application Nu^er ^'^^ ™ t ^ 
L:337 M:258 W: Mandatory Feature missing, <220> FEATURE. 
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